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USE OF GRAM-POSmVE BACTERIA 
TO EXPRESS RECOMBINANT PROTEINS 

BACKGROUND OF THE INVENTION 



1. Field of the Invention 

The present invention relates to the construction and use of a novel gram- 
posmve expression vector system. Protein fusions containing the amino and 
carboxy sorting sequences of a gram-positive surface polypeptide can be 
anchored to the surface of a heterologous host. Alternatively, recombinant 
proteins may be secreted into the growth medium. This system can be used for 
overproducing and purifying recombinant proteins or peptides for any purpose, 
including but not limited to commercial scale production of diagnostic and 
vaccine antigens and therapeutic proteins, such as hormones and growth factors. 



2. Description of the Related Art 

The ability to overproduce many prokaryotic and eukar>^otic proteins has 

15 been made possible through the use of recombinant DNA technology. The 
introduction of chimeric DNA molecules into Escherichia coh has been the 
method of choice to express a variety of gene products. The main impetus 
behind the use of £. co/z-based protein production systems is the host's shon 
generation time and well-developed genetics. Yet despite the development of 

20 many efficient £. co//-based gene expression systems in recent years, the most 

important concern continues to be that associated with downstream processing of 
the product. Recombinant proteins produced in E. coli do not readily cross the 
outer cell membrane (OM); as a result, polypeptides must be purified from the 
cytoplasm or periplasmic space (PS). Purification of proteins from these cellular 

25 compartments can be somewhat difficult. Frequently encountered problems 

include low product yields, contamination with potentially toxic cellular matenal 
(i.e., endotoxin) and the formation of large amounts of partially folded 
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polypeptide chains in non-active aggregates, called inclusion bodies. As a result 
of these inherent purification difficulties, a great deal of attention has recently 
been focused on the natural ability of many gram-positive bactena to export 
proteins beyond their cell wall boundaries. In spite of the fact that the genetics 
of gram-positive bacteria are not as well-elucidated as those of E. coli, these 
microorganisms have, nevertheless, become recognized as more favorable 
candidates as hosts for the production of recombinant proteins. 

Proteins exported across the gram-positive cytoplasmic membrane (CM) 
generally have two fates: they are either released (secreted) into the extracellular 
milieu or they remain anchored to the ceD wall/membrane. Many of the recently 
developed gram-positive based expression systems have reHed exclusively on the 
former route of protein export (i.e., secretion). The basic strategy for directing 
the secretion of proteins in gram-positive expression systems involves fusing 
target proteins with functional N-terminal signal sequences of the gram-positive 
secretion systems currently available. By far. the most popular hosts belong to 
the genus Bacillus. This group of microorganisms has been used by industry for 
the production of a variety of economically important protems (Priest, Bacteriol. 
Rev., 41:711 (1977); Glenn, Ann. Rev. Microbiol., 30:41 (1976)). Perhaps the 
most extensively studied gram-positive host-vector expression system is based on 
the B. amyloliquefaciens alpha-amylase gene (Ulmanen et al. , J. Bacteriol. 
162:176 (1985); Palva et al. , Proc. Natl. Acad. Sci. U.S.A., 79:5582 (1982); 
Palva et al., Gene, 22:229 (1983); Lundstrom et al. , Virus Res., 2:69 (1985)). 
However, expression systems based on other gram-positive hosts such as B. 
subtilis. B. coagulans, B. licheniformis (U.S. Patents 4,824,782, 5,171,673, 
4,711,843; WO patem 8,605,812; Chang, S., Methods in Enzymology, 153:507- 
516 (1987)), Lactococcus and Lactobacillus spp. (U.S. Patent 5,242,821) 
Staphylococcus spp. (Abrahmsen et al., EMBO J. 4:3901-3906 (1986)), 
Streptomyces spp. (U.S. Patent 4,745,056; JP Patent 2,002,379; EP 352,707) 
and Corynebacterium spp. (U.S. Patem 4,965,197; WO Patent 9.303,158) have 
also been developed for general use. 
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Gram-posiuve secreuon systems provide several advantages over 
traditional E. coli based expression systems. One obvious benefit is that protems 
exponed beyond the cell wall usually retain their nauve conformaaon. As a 
result, one can take full advantage of established purification protocols which are 
based on the functional properties of the active protem. In addition, gram- 
positive expression systems usually generate higher protein yields which are 
generally free of potentially toxic contaminating cellular material. 

There are, however, several practical hmitations to the punficauon of 
extracellular protems. The first concern is that of protein instability. 
Recombinant proteins secreted by many gram-positive hosts are extremely 
sensitive to proteolytic degradation by host-encoded extracellular proteases. The 
presence of such proteases can drastically affect protein yield. Fortunately, 
significant improvements have been achieved in maintaining the viability of 
secreted proteins by geneticaUy modifying host strains with reduced extracellular 
protease activities (Kawamura et al, J. BacterioL, 160:442 (1984); U.S. Patent 
5,084,383). 

Another important point to consider when using gram-positive secretion 
systems is the requirement for extensive downstream processmg. It is wddely 
accepted that the purification of recombinant proteins from bullcy, large volume 
fermentations is an extremely time consuming and costly proposition, both in 
terms of equipment and manpower. In those cases where the use of a gram- 
positive secretion system is simply not practical, it is common to resort to a less 
expensive, gram-negative based expression system. The decision to use E, coli 
as an alternative expression host, despite the potential pitfalls, is made simply on 
the basis that recombinant polypeptides remain associated with the bacterial cell 
(i.e., intracellular or periplasmic) and hence are generally easier to purify. 
Therefore, it is clear that what is needed in this art is a bactenal expression 
system that incorporates the salient features of both gram-posmve (i.e., protein 
secretion) and gram-negative (i.e., protein compartmentalization) systems. A 
novel, but simple alternative to the currently available bactenal expression 
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systems would be the development of a gram-positive system that is able to 
specifically anchor or attach recombmant proteins directly to the cell wall 
surface. In this way the anchoring process can become an integral part of the 
purification process. For example, cells harboring a recombinant protein 
attached to the cell surface would be washed, coUected, resuspended in a small 
volume and then treated with a specific agent to affect the release of the desired 
protein. Following removal of the bacterial cells, the resulting supernatant fluid 
would be highly enriched for the protein product. 

The cell wall of gram-positive bacteria is a complex organelle, which is 
assembled from peptidoglycans, carbohydrates, and proteins with different 
biological properties. Surface proteins differ from naturally secreted products in 
that the former require specific sorting signals that presumably allow them to 
deviate from the normal default pathway of protein export. Despite the fact that 
many of the biologically important gram-positive proteins, including the 
streptococcal M protein, and the protein A and fibronectin binding proteins of S. 
aureus, are anchored at the ceU surface, the cell wall of gram-positive bacteria 
remains a relatively unknown cellular compartment. 

While the normal N-terminal export signals that serve as the basis for 
bacterial secretion systems have been extensively characterized (Abrahmsen et 
al., EMBO J., 4:3901-3906 (1985); Heijne and Abrahmsen, FEES Letters, 
244:439-446 (1989); Pugsley, Microbio. Reviews, 57:50-108 (1993)), the ex- 
acting sorting signals that orchestrate the anchoring of ceUular proteins to the 
gram-positive cell wall have been only recently elucidated. Sequence alignment 
of well over 50 gram-positive surface proteins to date (excluding those from B. 
subtilis and S. pneumoniae) has revealed the existence of both amino- and 
carboxy-terminal secretion/anchoring signals. As expected, all of the proteins 
examined contain an N-terminal signal sequence that presumably directs them to 
the cellular export machiner>'. The C-termini of these proteins contain a 
predominantly hydrophobic, potential membrane spanning region that is 
immediately followed by a charged tail (Fischetti ei al, Mol. Microbiol, 4:1603- 
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1605; Figure 1: Darkly shaded region = LP(X)TGrX) consensus sequence; 
lightly shaded region = carboxy-termmal hydrophobic, potential membrane 
spanning region; residues m bold = charged protein tail; Protein A = 
Staphylococcus aureus protein A; M6 - Streptococcus pyogenes M6 protein; 
WapA = 5. mutans wall-associated protein A; M49 = S, pyogenes M49 protein; 
IgA-BP - streptococcal IgA-binding protein; Protein G = streptococcal protein 
G; Fn-BP = staphylococcal fibronectin-binding protein; T6 = streptococcal T6 
protein; Pac = 5. mutans surface protein; Wg2 = 5. cremoris surface protease). 
Preceding the hydrophobic domain is a proline/glycine-rich region which has 
been predicted to span the peptidoglycan cell wall in a beta-sheet-like 
conformation. Within this region, all of the surface proteins examined have a 
nearly 100% conserved hexapeptide with the consensus LP(X)TG(X), where X is 
usually a Thr or Ser and sometimes a Gly, Lys, or Asn (Figure 1). The 
conservation of the C-terminal sequence elements (i.e., LP(X)TG(X) motif, 
15 hydrophobic membrane spanning region and charged tail) suggests that the 

process of sorting and anchoring polypeptides to the bacterial cell wall is shared 
by many gram-positive species (Fischetti et al , Mol Microbiol, 4:1603-1605 
(1990); Fischetti et al , Curr. Opinion Biotechnol, 4:603-610 (1993)). 

The functional importance of the carboxy-terminal sorting elements 
20 identified by sequence alignment analysis has been confirmed using protein A 
(Moks et al, Eur. J. Biochem., 156:637-643 (1986)), a well-charactenzed 
surface protein of 5. aureus, as a model system. Protein A, a single polypeptide 
chain, contains two major functional domains. The N-terminal domain contains 
a 36 amino acid signal peptide sequence followed by several repeated sequence 
25 modules (E, D, A, B, and C) that have immunoglobulin binding activity. The 
C-terminal domain contains conserved sorting signals which are thought to be 
responsible for anchoring protein A to the cell wall (Guss et al , Eur. J. 
Biochem., 138:413-420 (1984); Guss et al. , Eur. J. Biochem, , 143:685 (1984); 
Fischetti et al. Mol Microbiol. 4:1603-1605 (1990)). Usmg deletion analysis, 
30 Schneewmd et al {Cell 70:267-281 (1992)) confirmed that the proper sorting of 
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protein A in S. aureus requires all three of the conser\'ed sequence elements 
(i.e., -LPETGE-motif, C-terminal hydrophobic domain, and charged tail). Based 
on their results, Schneewind et al. {Cell, 70:267-281 (1992)) have proposed the 
following sequence of events in the anchoring of protein A and presumably many 
other gram-positive proteins to the cell surface. First, the N-terminal signal 
sequence of protein A directs the polypeptide into the general secretion pathway 
(GSP) (for reviews, see Pugsley, Microbio. Reviews, 57:50-108 (1993); Salmond 
and Reeves, TIBS 18:7-12 (1993)). During the process of membrane 
translocation, the main function of the C-terminal charged tail is to prevent 
secretion of the protein into the medium. Translocation of protein A across the 
membrane presumably results in the recognition of the C-terminal sorting signal. 
Following recognition, the LPETGE (LPXTGX) motif is cleaved specifically 
between the threonine (T) and glycine (G) residues and the resulting N-terminal 
protein fragment is then covalently linked to the S. aureus cell wall (Navarre and 
Schneewind, Mo!. Microbiol., 14:115-121 (1994)). 

Because the anchor motif of gram-positive surface proteins is conserved 
among a wide variety of molecules and several different gram-positive species, a 
logical question to ask is whether it would be possible to anchor a well-known 
surface protein to the cell wall of a heterologous gram-positive host. Pozzi et al. 
(Res. Microbiol., 143:449-457 (1992)), using the fibrillar streptococcal M 
protein, were the first to address the question of protein anchoring in 
heterologous gram-positive hosts. Structurally, the M protein consists of an 
extended central alpha-helical coiled-coil rod flanked by functional end domains 
(Figure 2) (Phillips et al, Proc. Natl. Acad. Sci. (USA), 78:4689-4693 (1981); 
Fischetti, Clin. Microbiol. Reviews, 2:285-314 (1989))(Figure 2, (A) Structural 
representation of the M6 protein; (B) Linear representation of the M6 protem 
molecule. The map displays the N-terminal signal peptide (S), several repeated 
domains (A, B, C, and D), the proline/glycine rich region that contains the 
LPSTGE motif (LPXTGX), the C-terminal hydrophobic, potential membrane 
spanning domain (black bar), and charged tail (KRKEEN). Amino and carboxv 
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terminal residues important for export and cell wall soning/anchonng of the M6 
protein are shaded. The anchor region is not drawn to scale.). The N-termmus 
contains a 42-residue signal sequence required for proper processing (Fischetti, 
Clin, Microbiol. Reviews. 2:285-314 (1989); Hollingshead et al . J, Biol Chem, , 
261:1677-1686 (1986)), while the C-terminus, as determined by sequence 
comparisons, contains the conserved residues that are necessary for cell sorting 
and anchonng (Fischetti er ai , Mol Microbiol, 4:1603-1605 (1990); Pancholi 
and Fischetti, 7. Bactenol ^ 170:2618-2624 (1988); Figure 1). After integrauon 
of a promoterless emm-d, 1 gene in the chromosome of the human oral 
commensal 5. gordonii, it was demonstrated that the M6 protein was correctly 
localized to the cell wall of the heterologous host (Pozzi et al , Res. Microbiol , 
143:449-457 (1992)). 

The next logical step was to examine the possibility of exploiting the M6 
sorting signals in order to express recombinant proteins on the surface of a gram- 
15 positive host. The M6 protein has been successfully modified to deliver several 
heterologous antigens, such as sequences derived from the human papilloma virus 
type 16 E7 protein, the immunodominant epitope of HIV-1 gpl20 and the major 
allergen of white-face hornet venom, to the surface of 5. gordonii (Pozzi et al , 
Infect, Immun., 60:1902-1907 (1992); Pozzi et al. Vaccine, 12:1071-1077 
20 (1994); Medaglini et al, Proc, Natl Acad, Sci. (USA), in press (1995)), Gram- 
positive oral commensal bactena expressing recombinant fusion proteins on their 
cell surface have been used to elicit both a mucosal and systemic immune 
response to foreign antigens while colonizing the oropharynx (Fischetti et al , 
Curr. Opinion Biotechnol, 4:603-610 (1993); Medaglmi et al, Proc. Natl 
25 Acad. Sci. fUSAj, m press (1995)), 

The recent results of the M6 experiments are important for two main 
reasons. First, they serve to confirm the hypothesis that the mechanism of 
protein sorting and anchoring is an extremely well conserved, if not universal, 
process m gram-positive bacteria. More importantly, these experiments are 
evidence for proof of the concept that it is possible to exploit cell wall protein 



30 
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sorting signals for the purpose of constructing novel expression systems that 
anchor recombinant proteins to the surface of a gram-positive bacterial host. 
Therefore, in view of the aforementioned deficiencies attendant with prior art 
expression systems, it should be apparent that there still exists a need in the art 
for the construction and use of a unique gram-positive protein expression system. 

SUMMARY OF THE IN\ ENTION 

Accordmgly, a major objective of the present invention is to develop a 
gram-positive bacterial expression system that can be used for the production and 
purification of recombinant proteins. 

It is also the objective of this invention to provide, as a specific example, 
the construction and use of the SPEX system (for Streptococcal protein 
expression), which is based on the streptococcal M6 protein. This system will 
allow one familiar in the art of genetic manipulation and protein biochemistry to 
express and purify recombinant proteins from gram-positive hosts, such as the 
human oral commensal S. gordonii. 

More specifically, the present invention provides a method for expressing 
proteins m gram-positive bacteria, compnsing selecting a heterologous bacterial 
host which is gram-positive, deficient in extracellular protease production, and 
capable of anchoring surface proteins via C-terminal sorting signals; introducing 
into the host a plasmid vector compnsing DNA fragments encoding amino and 
carboxy terminal sorting sequences of a gram-positive cell wall surface protein, 
DNA fragments coding for the desired protein product, and transcriptional and 
translational control sequences that function in the chosen host; expressing a 
fusion protein comprising the desired protein product, an N-terminal signal 
sequence, and a C-terminal sorting sequence wherein the expressed protein is 
anchored to the host^s cell surface; cleaving the anchored protein from the host^s 
cell surface using a protease; and purifying the cleaved protein from the 
supernatant fluid. The present invention also provides an alternative method for 
expressing proteins in a gram-positive host whereby the desired protein product 
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IS secreted into the extracellular space, and for purifying the secreted proteins 
from the supernatant fluid. 

Briefly, the present invention features general methods for the 
construction and use of a gram-positive host/vector system that uses the cell 
sorting signals of surface proteins for targeted attachment to the cell wall. To 
demonstrate the utility of this system, the invention also describes the 
construction of several specialized vectors, designated SPEX, which are based on 
the S. pyogenes M6 protein. These vectors are intended to be used for the 
expression of proteins and peptides in gram-positive bacteria such as the human 
oral commensal, S. gordonii, A variety of recombinant protein products can be 
expressed by selecting the appropnate SPEX cloning vector. Several protein 
configurations are possible such as the presence of a combination of polyhistidme 
tag and protein A IgG binding domains, factor Xa or TEV protease cleavage 
sites, and a cell wall anchoring domain LP(X)TG(X). Each of the clonmg 
15 vectors acts as a shuttle plasmid which permits the initial cloning to be performed 
in E. coli, followed by transformation into a gram-positive host, such as the 
naturally competent 5. gordonii V288 (Challis). Depending on the SPEX vector 
used, recombinant proteins can be purified from an anchored state after cleavage 
with a specific protease or, alternatively, from the culture medium. 
20 With the foregoing and other objectives, advantages and features of the 

invention that will become hereinafter apparent, the nature of the invention may 
be more clearly understood by reference to the foUowmg detailed description of 
the preferred embodiments of the invention and to the appended claims. 

BRIEF DESCRIPTION OF THE DRAWINGS 

25 Figure 1 depicts the alignment of the C-termmal residues from a select 

group of gram-positive surface proteins. 

Figure 2 depicts the functional domains of the S. pyogenes M6 protein. 
Figure 3 depicts the nucleotide sequence of the S. pyogenes emm6 gene 
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and adjacent DNA regions. 

Figure 4 is a diagrammatic representation of the construction of SPEX 
vectors. The pBLUESCRIPT (ApO backbone of each plasmid is not shown. 

Figure 5 illustrates possible SPEX cassettes useful for anchored surface 
production and purification of recombinant proteins. 

Figure 6 illustrates possible SPEX cassettes used for the secretion and 
purification of recombinant proteins. 

Figure 7 depicts some possible 5. gordonii strains for integrated 
expression of M6 fusion proteins. 

Figure 8 depicts integration of the M6 fusions into the S. gordonii 
chromosome. 



DETAILED DESCRIPTION OF TELE PREFERRED EMBODIMENTS 

OF THE INVENTION 

More particularly, the present invention relates to general methods for the 
construction and use of a novel gram-positive expression system. Proteins 
produced by the expression system described herein can be punfied either 
directly from the cell surface or from the supernatant fluid following secretion. 
For example, an expression system based on the well-known M6 protein of 
S. pyogenes is also descnbed. This expression system, designated SPEX (for 
Streptococcal protein expression), uses the human oral commensal Streptococcus 
gordonii as a model host for this group of organisms. 

A. General Method 

The first major factor to be considered when constructing the gram- 
positive expression system is to determine which bacterium to use as a host. To 
be effective in the present invention the bacterial host should be a gram-positive 
bacteria that is capable of anchoring surface proteins via C-terminal sorting 
signals. Such gram-positive bactena include, but are not limited to. the eenera 



wo 96/40943 



PCTrS96/0996 



-11- 

Aerococcus, Bifidobaaenum. Corprococcus . Deinobaaer, Deinococcus. 
Enrerococcus. Gemella. Lactobacillus, Lactococcus, Leuconostoc. 
Marinococcus, Melissococcus, Micrococcus, Pediococcus, Peptococcus, 
Pepiostrepiococcus. Planococcus. Ruminococcus , Saccharococcus , Salinococcus ^ 
Carcina. Staphylococcus, Stomatococcus , Streptococcus, Trichococcus , and 
Vagococcus. To gain the maximum benefit from the present invention, the 
bacterial host should be deficient in extracellular protease production. 

In the present invention the more preferred species to be used as an 
expression host is Streptococcus gordonii. The most preferred strain is a 
protease deficient vanant of V288 (Challis). 

The second major consideration is the selection of the gram-positive cell 
wall protein to be used as the basis for the construction of the expression vectors. 
Nearly all surface molecules of gram-positive bacteria have amino and carboxy 
terminal sorting signals that specifically target them for cell wall attachment 
CFischetti et al,MoL Microbiol, 4:1603-1605 (1990)). The expression system 
used in the pracace of the present invention can be constructed with any gram- 
positive surface protein that contains the appropriate sorting signals. Such 
sorting signals can be readily identified by those skilled in the art. Sources of 
surface proteins include, but are not limited to, such gram-positive genera as 
Aerococcus, Bifidobacterium, Corprococcus , Deinobacter, Deinococcus, 
Enterococcus. Gemella, Lactobacillus, Lactococcus, Leuconostoc^ Marinococcus . 
Melissococcus, Micrococcus, Pediococcus, Peptococcus, Peptosrreptococcus , 
Planococcus, Ruminococcus , Saccharococcus , Salinococcus , Carcina, 
Staphylococcus, Stomatococcus, Streptococcus, Trichococcus, and Vagococcus, 

The more preferred protein to be used in the present invention is the type- 
6 M protein of Streptococcus pyogenes. The structural gene for the type 6-M 
protein from S. pyogenes has been cloned (Scott and Fischetti, Science, 221:758- 
760 (1983)) and the complete nucleotide sequence has been determined 
(Hollingshead er a!.. 7. Bioi Chem. 261:1677-1686 (1986)). This sequence is 
shown in Figure 3. The predicted M6 amino acid residues are shown below the 
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DNA sequence. The putative ribosome binding site fRBSj is boxed and the 
signal peptide (-42 to -1) is underlined. The mature polypeptide represents 
residues +1 to 441. The proline/glycine rich cell wall domain is shown as 
unshaded rectangles. The LPXTGX (LPSTGE) motif is shown in bold letters. 
The adjacent lightly shaded rectangle represents the hydrophobic, membrane 
spanning domain and the darkly shaded rectangle represents the C-terminal 
charged tail. Relevant restriction sites are shown above the DNA sequence. A 
transcnptional terminator is shown as the inverted repeat structure denoted by the 
arrows. 

The next consideration is to isolate and characterize the DNA sequence 
that codes for the cell waU protein in the chosen host. Methods used for the 
construction and screenmg of genomic libraries (i.e., plasmid, cosmid, phage or 
YAC) to isolate the appropriate protein coding sequence and techniques for 
determining its nucleotide sequence are known to those skilled in the art of 
genetic engineering. See, e.g., Sambrook, Fritsch & Maniatis, Molecular 
Cloning: A Laboratory Manual, Second Edition (1989); Maniatis, Fritsch, & 
Sambrook, Molecular Cloning: A Laboratory Manual (1982) and Griffin & 
Griffin, PCR Technology: Current Innovations (1994). 

The DNA fragments that encode the amino and carboxy terminal sorting 
signals of the surface protein can be isolated using recombinant DNA techniques 
familiar to those skilled in the art. These fragments can be assembled on an 
E. coli cloning vector such as pUC18/19 or any other appropriate plasmid 
element. The N-terminal signal should include a complete leader sequence as 
well as a number of amino acid residues from the N-terminus of the mature 
protein. The inclusion of these sequences will insure proper targeting to the 
cellular export machinery and correct processing during the actual translocation 
process. The C-terminal sorting (i.e., anchor) signals should include the 
LPXTGX motif (where X is any amino acid residue), a hydrophobic membrane 
spanning domain and a charged tail. A comparison of such sorting sequences is 
shown in Figure 1 . The darkly shaded sequences on the left side of the 
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alignment represents the LP(X)TG(X) consensus sequence and the proteins were 
aligned from this sequence. The lightly shaded region on the nght side of the 
alignment represents the carboxyl-termmal hydrophobic, potential membrane 
spanning region. Residues shown in bold represent the charged protein tail. 

To facilitate the construction of heterologous fusion proteins, a small 
DNA fragment containing the recognition sequences for several restriction 
endonucleases can be placed between the DNA sequences that encode the ammo 
and carboxy terminal sorting signals. This multiple cloning site (MCS) will 
facilitate the insertion of heterologous protein coding sequences and upon 
expression will generate in-frame protein fusions that are anchored to the gram- 
positive cell wall surface. Alternatively, expression vectors can also be 
constructed that lack the C-terminal anchor signals. In this case, the protein 
fusions will not be targeted for anchoring but, instead, will be secreted into the 
extracellular space. 

The vectors described in the present invention will allow one familiar in 
the art to produce recombinant proteins that are either anchored at the cell 
surface or secreted into the extracellular space. Regardless of the final 
destination of the target protein, all of the vectors contain the M6 nbosome 
binding sue (RBS), translation initiation codon (ATG) and the 42 amino acid 
leader sequence (-42 to -1) (Figure 3, 4 and 5). In addition, all of the vectors 
contain some portion of the mature N- terminus of the M6 protein. Previous 
studies have demonstrated that as many as 122 amino acids of the mature M 
protein N-terminus are also required for successful translocation. However, 
more recently, it has been shown that as few as 5 N-terminal ammo acid residues 
of the mature protein may be required for proper processing and translocation of 
a M6 fusion protein to the cell surface (Fischetti et al , Curr, Opinion 
Biotechnol. 4:603-610 (1993)). The preferred number of M6 amino terminal 
residues to be used in the practice of the present invention is 58 and include 
residues -42 to -^16 (Figure 3). 

The preferred C-termmal sorting signals used in the present invention are 
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aJso denved from the M protein of type-6 S. pyogenes. The vectors designed for 
targeted protein anchoring contain the M6 C-terminal residues 302 to 441, 
termination codon (TAA) and downstream sequences, which include a putative 
transcriptional terminator (Figure 3). Vectors that are designed for protein 
secretion do not contain the M6 C-terminal sorting signals. 

The expression vector systems of the present invention are suitable for 
overproducing and purifying any desired protein. At the ver\ least, the 
expression vector systems of the present invention are useful for overproducing 
and purifying recombinant proteins or peptides. These proteins or peptides may 
be used for any purpose; for example, they may be used to generate diagnostic 
and vaccine antigens and therapeutic proteins, such as hormones and growth 
factors, and the like. 

To facilitate the release of anchored fusion proteins from the gram- 
positive cell wall, cleavage sites for various proteolytic enzymes can be 
engineered into the expression vectors. These unique sequences can optionally 
be incorporated into the vectors so that they reside either immediately 
downstream of the N-terminal signal peptide or immediately upstream of the C- 
terminal sorting signal domain. It is also possible to engineer cleavage sites at 
both locaUons. Proteases that may be suitable include, but are not limited to, the 
proteinase of tobacco etch virus (TEV), thrombin, enterokinase, and factor Xa. 

A variably sized protein spacer or linker region that physically displaces 
the fusion protein away from the cell surface may also be incorporated into the 
expression vectors. It is possible that in the absence of these spacers the 
efficiency of protease cleavage may be reduced due to steric hinderance with the 
cell wall surface. These linker sequences can be incorporated between the 
protease recognition sequence and the C-terminal sorting signal domain. 

Any unique protease cleavage site can be incorporated mto the vectors of 
the present invention. Preferably, the protease cleavage site will be greater than 
4 amino acid residues. The most preferred protease for use with the present 
invention is the TEV NIa proteinase. The TEV NIa proteinase cleaves a specific 
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consensus cleavage sue which spans the seven ammo acid sequence E-X-V/I/L- 
Y'X-Q*S/G (X can be any ammo acid residue; Doughten* et al.^ EMBO 7., 
7:1281-1287 (1988)). The preferred cleavage site for the present invention is E- 
N-L-Y-F-Q'^G (Parks et al, Anal. Biochem, 216:413-417 (1994)). TEV 
cleavage sites can be engineered anywhere along the fusion construct; however, 
the preferred location is immediately adjacent to the proline/glycine rich region 
harboring the LPXTGX motif (see Figure 5: RBS = M6 ribosome binding site; 
ATG = imtiauon codon; MCS = multiple cloning site (MCS); black bar = 
membrane spanning domain; closed triangles = engineered cleavage site for 
TEV (tobacco etch virus) NIa protease; open triangle = cleavage site for factor 
Xa (lEGR/); small vertical arrow 16 = putative cellular signal peptidase 
cleavage site; see SPEXlb and SPEXlc). The addition of a protein spacer region 
between the TEV recognition sequence and the glycine/proline-rich anchor region 
may improve TEV NIa proteinase efficiency. The preferred spacer for the 
15 present invention is derived from the M6 protein and contains amino acid 
residues 222 to 301 (Figure 5; SPEXld). 

Following protease treatment, the released proteins can be purified in a 
variety of ways known in the art. Suitable purification method include, but are 
not limited to, dialysis, ultrafiltration, zonal centrifugation, molecular-exclusion 
20 chromatography, isoelectric precipitation, solvent fractionation, electrophoretic 
separation, thermal precipitation, ion-exchange chromatography, affinity 
chromatography, and the like. A single step purification using affinity tags is 
preferred. These affinity tags, like the protease cleavage sites, can be optionally 
engineered at either the amino or carboxy terminal regions of the fusion protein. 
25 Useful affinity tags include, but are not limited to, a polyhistidine tract 

(HHHHHH), the IgG binding domain of protein A and glutathione S-transferase 
(GST). Recombmant proteins released from the surface of a gram-positive host 
can be easily punfied in a one step process using metal chelation (e.g., Ni- 
agarose), protein A-Sepharose, and glutathione-Sepharose column 
30 chromatography. N-termmal signal and affinity tag sequences can be easily 
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removed by incorporating a protease cleavage site that is different from the 
recogmuon sequence used to remove the protein from the cell surface. 

The preferred affinity tag for the present invention is a consecutive stretch 
of 6 to 10 histidine residues (HHHHHH). A polyhistidme tag of six amino acid 
5 residues has been shown to be poorly immunogenic and rarely affects protein 

function and structure. The polyhistidine affinity tag can be engineered at either 
the amino or carboxy terminus of the protein. For the present invention, the 
preferred site is immediately downstream of the N-termmal M6 sequences 
(Figure 5). 

10 The N- terminal M6 and affinity tag sequences can be removed from the 

purified proteins by engineering in a second protease cleavage site. The 
preferred enzyme for the present invention is factor Xa. The preferred location 
is immediately downstream of the affinity tag (Figure 5). 

In order to efficiently express the recombinant gene which encodes the 
15 fusion protein, regulatory sequences should preferably be included that assure 
adequate transcription and translation. These vectors should contain 
transcripuonal (i.e., promoters) and translational (i.e., ribosome binding site) 
control sequences that function in the chosen gram-positive host. 

A number of promoters may optionally be used in the practice of the 
20 present invention. These regulatory sequences, which are known to those skilled 
in the an, include heterologous promoters from both gram-negative and gram- 
positive bactena. These regulatory sequences can cause the expression of a gene 
to be turned on or off in response to a chemical or physical stimulus. 
Alternatively, constitutively expressed promoter elements can also be used. 
25 Gene dosage is a variable that has been shown to possibly have an effect 

on protein production in several bacterial expression systems. In general, genes 
present on multiple copy plasmids generate higher levels of protein. The 
plasmids used for the expression of recombinant proteins in the present invention 
should be capable of stably replicating in the chosen gram-positive host. These 
plasmids, which replicate at either a high or low copy number, are familiar to 



30 
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those skilled in the an. 

In the present invention, the extrachromosomal elements used m plasmid- 
linked expression of recombinant genes may optionally contain appropriate 
genetic markers for selection purposes. Suitable genetic markers include various 
5 antibiotic and heavy metal resistance genes as well as nutritional determinants. 
However, other genes that provide a selecdve advantage can also be used. 
Ideally the selective markers should be capable of being expressed in both £. coU 
and gram-positive hosts. This would allow one to carry out the mitial cloning in 
E. coll with subsequent transfer of the recombinant plasmid to a gram-positive 

10 host. Furthermore, in order to facilitate the insertion of heterologous DNA 

fragments, plasmids should also contain an in-frame polylinker or MCS which 
includes useful restriction endonuclease cleavage sites. 

Techniques for introducing recombinant plasmids into nontransformable 
hosts are familiar to those skilled in the art. These methods include, but are not 

15 limited to, electroporation (Dower, in Genetic Engineering-Principles and 
Methods Vol. 12, pp. 275-296 (1990)), chemical transformation, conjugation, 
transduction, and the like. Recombinant DNAs can be easily introduced into 
those gram-positive species that are naturally competent by transformation. 

The presence of certain genes on multiple copy plasmids can occasionally 

20 cause toxicity and, in some cases, cell death. The toxic effects of these gene 
products can many times be reduced by simply decreasing the copy number of 
the determmant to be expressed. This can be readily achieved by stably 
integrating the gene encoding the fusion product directly into the host 
chromosome. The genetic methods that are commonly used to create single copy 

25 prokaryodc expression modules involve homologous, site specific and illegitimate 
(transposition) recombination. 

The preferred method of gene expression for the present invention 
involves integration of the recombinant SPEX construct into the 5. gordonii 
chromosome. 5. gordonii strains that are useful as expression hosts include 

30 GP230 (Pozzi et a!.. Res, Microbiol. 143:449-457 (1992)), GP232 (Pozzi et al . 
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Infeci. Immun., 60:1902-1907 (1992)) and GP251 (Pozzi er al. Vaccine, 
12:1071-1077 (1994); Figure 7: emm-6 is the structural gene for the S. pyogenes 
type 6 M protein. The ermC and cat genes encode erythromycin and 
chloramphenicol resistance, respectively. Thin lines flanking the inserted 
(boxed) sequences represents the S. gordonii chromosome. The integrated 
sequences are located downstream of a strong chromosomal 5. gordomi promoter 
(P)). These strains contain genetic cassettes that have been integrated 
downstream of a strong S. gordonii promoter via homologous recombmation. In 
the present invention, these cassettes provide flanking regions of homology that 
will allow one skilled in the an to replace the sequences present on the GP230, 
GP232 and GP251 with the recombinant SPEX plasmids constructed in vivo. 

The following examples are presented in order to more fuUy illustrate the 
preferred embodiments of the invention. They should in no way be construed, 
however, as limiting the broad scope of the invention. 

EXAMPLE 1: Construction of SPEX plasmids 
A. Anchored vectors 

Plasmid pVMB20 (Pozzi et al. Infect. Immun. 60:1902-1907 (1992)) was 
constructed by subcloning a 3.4 kb Clal fragment, which contains a promoterless 
emm-6 gene and a gene encoding erythromycin resistance {ermC: Honnouchi and 
Weisblum., J. Bacteriol. 150:804-814 (1982)), from pVMB3 (Pozzi et al.. Res. 
Microbiol. 143:449-457 (1992)) into the plasmid pBLUESCRIPT. A 545 bp 
interior region of the emm-6 gene was removed by Kpnl and HinDUl digestion 
and replaced with a small DNA fragment harboring a synthetic multiple cloning 
site (MCS). The resulting plasmid, designated pSMB55 (Figure 4), contains 
sequences encoding the first 122 N-terminal and the last 140 C-terminal amino 
acids of the M6 protein. DNA fragments encoding heterologous protein 
sequences are inserted into the MCS of pSPEXla or pSPEX2a to form m-frame 
fusions with the M6 N-terminal and/or C-terminal sequences. 
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The Sacl-Hpal fragment from witiiin the ermC determinant of pSMB55 was 
removed and replaced with a fragment harboring the aphlll gene encoding 
kanamycm resistance (Tneu-Cuot and Courvalm, Gene, 23:331-341 (1983)). 
The resulting plasmid was designated pSMB113 (Figure 4). 
5 Plasmid pSMB104 carries the emm-6^^^ gene encoding for M6^icm. a 

protein contaimng the first 16 N-terminal and the last 220 C-terminal amino acids 
(Pozzi, G., unpublished data). The Clal-Kpnl fragment of pSMB104 was used to 
replace the existing Clal-Kpnl fragment of pSMB113. The resulting plasmid 
vector, designated pSPEXla (Figure 4), can be used to construct protein fusions 
10 that contain the N-termmal 58 and C-terminal 140 amino acids of the M6 

protein. These protein fusions, when expressed in an appropriate gram-positive 
host, will be anchored to the bacterial cell surface (Figure 5). 

B. Secretion vectors 

Deletion of the HinDIH-Sacl fragment of pSPEXla effectively removes 

15 the C-terminal sorting signals of the M6 protein. The resulting plasmid, 
designated pSPEX2a, retains the M6 N-terminal 58 amino acid residues 
necessary for protein translocation and processing (Figure 4). As a result, 
protein fusions generated with pSPEX2a will be secreted into the growth 
medium. TEV proteinase cleavage sites and polyhistidine tags can be 

20 incorporated to facilitate the removal of non-essential protein sequences and for 
purification purposes, respectively (Figure 6: RBS = M6 ribosome binding site; 
ATG = initiation codon; Term = termination codon; triangle = engineered 
cleavage site for TEV (tobacco etch virus) NIa proteinase; shaded residues = 
important for proper processing and secretion of M6; small vertical arrow = 

25 putative cellular signal peptidase cleavage site). 
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EXAMPLE 2. Expression of recombinant genes in 5. gordonii 

In the present invention, recombinant genes encoding M6 fusion proteins 
can be expressed from multicopy plasmids. Since the preferred expression host 
in the present invention is S. gordonii, plasmids containing an origin of 
replication {oriV) that functions in a vanety of streptococcal species can be used 
(Horodmceanu et al., Antimicrob. Agents Chemother., 10:795-801 (1976); 
Clewell et aL, 7. Bacteriol. 117:283-289 (1974); Behnke and Ferretti, Plasmid, 
4:130-138 (1980); Macrina et aL, Infect. Immun. 28:692-699 (1980); Macrma et 
al. Gene 19:345-353 (1982)). Alternatively, recombinant genes can be 
expressed m the form of single copies which have been stably inserted into the 
5. gordonii chromosome. 

The recombinant plasmid is naturally linearized during transformation 
(Pozzi et al. Res. Microbiol.. 141:659-670 (1990); Pozzi et al. Infect, Immun., 
60:1902-1907 (1992)) and recognition of the flanking homologous segments 
facilitates integration of the M6 gene fusion, together with the aphlll gene 
(Figure 8: the cat gene present in the 5. gordonii recipient strain GP251 confers 
chloramphenicol resistance). Recombinant 5. gordonii are selected for 
kanamycin resistance (conferred by aphlll) using the "multilayer" plating 
technique as previously described by Pozzi et al (FEMS Microbiol. Lett. , 48: 
189-194 (1987)), Kanamycin-resistant transformants are then scored for loss of 
either erythromycin (conferred by ermQ or chloramphenicol (conferred by cat) 
resistance (depending on the recipient). Antibiotics are used at the followina 
concentrations: erythromycin, 5 ^g/ml, chloramphenicol, 5 fig/ml, and 
kanamycin, 500 fig/ml. Southern blot (Southern, J. MoL Biol, 98:503 (1975); 
Southern, Methods EnzymoL, 69:152 (1980)) or PGR analysis of purified 
genomic DNA of Km'(Cm' or Em') transformants can be used to establish the 
structure and copy number of the expression cassette in the bacterial 
chromosome. 

Production of the recombinant protein can be determined by Western blot 
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analysis of whole cell extracts (Towbm ei al. , Proc. Natl Acad. Scl OJSA;, 
76:4350 (1979); Towbin and Gordon, J. Immunol, 72:313 (1984)). Surface 
expression of the M6 fusion protein can be tested by immunofluorescence 
accordmg to the procedure described previously (Pozzi ei al. Vaccine, 12:1071- 
1077 (1994)). Recombinant proteins can be detected using polyclonal antisera 
against the entire M6 molecule, monoclonal antibodies directed against specific 
M6 epitopes (Jones et al , J. Exp. Med,, 164:1226-1238 (1986); Jones and 
Fischetti, 7. Rxp. Med,, 167:1114-1123 (1988)) or antisera directed against the 
heterologous protein expressed as a part of the fusion. 

EXAMPLE 3. Purification of recombinant proteins 
from 5. gordonii 

Recombinant S, gordonii are grown in an appropriate liquid medium such 
as Todd Hewitt- Yeast Extract Broth (THYEB), brain heart infusion (BHI) or 
trypticase soy broth (TSB) containing kanamycin at 37°C for 24 hours or until 
stationary phase growth is achieved. The bacterial culture is centrifuged and 
depending on the location of the recombinant protein (cell-wall anchored or 
secreted) the cell pellet or the spent growth medium is retained for further 
analysis. 

S. gordonii cells harboring surface expressed proteins are collected, 
washed and resuspended in a minimal volume of PBS (150 mM NaCl, 16 mM 
Na.HPO^, 4mM NaH2P04 pH 7.3), in the presence of proteinase inhibitors. 

Recombinant proteins with engineered protease cleavage sites are removed 
from the cell surface by the addition of TEV NIa proteinase. Following 
incubation for a suitable period of time at 25-30°C, the cells are removed by 
centrifugation and the supernatant fluid, which is ennched for the recombinant 
protein, is collected. Those fusion proteins contaimng tandem histidine residues 
can be further purified by passing the supernatant over a nickel-chelating resin 
(ProBond, Invitrogen Corp.) or nickel nitrilotriacetic acid [Ni-NTA] agarose 
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(Qiagen Inc.). Following washing, the recombinant protein is then eluted from 
the column using an imidizole gradient under mild conditions (i.e., pH 6.0). The 
amount of purified protein can be measured by the method of Bradford 
(Bradford, ArmL Biochem., 72:248-254 (1976)). 
5 The use of the SPEX2 series of vectors (Figure 6) generates fusion 

proteins that are secreted into the surrounding medium during growth. 
Following removal of the bacterial cells, the spent growth medium is retained 
and used as the starting material for purification of the desired product. 
Recombinant proteins can be purified using metal chelation chromatography, as 
10 described above, and non-essential N-terminal residues can be removed with 
TEV NIa proteinase treatment. 



While the invention has been described and illustrated herein by 
references to various specific materials, procedures, and examples, it is 
understood that the invention is not restricted to the particular combinations of 
15 materials and procedures selected for that purpose. Numerous variations of such 
details can be implied and will be appreciated by those skilled in the art. 
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WHAT IS CLAIMED IS: 

1. A method for expressing a desired protein in gram-positive bacteria, 
comprising: 

selecting a heterologous bacterial host which is 
5 gram-positive, and 

capable of anchoring surface proteins via C-terminal sorting signals; 
introducing into the host a plasmid vector comprising 
DNA fragments encoding 

amino and carboxy terminal sorting sequences of a gram-positive 
10 cell wall surface protein and 

the desired protein, and 
transcriptional and translational control sequences that function in the 
chosen host; 
expressing a fusion protein comprising 
15 the desired protein, 

an N-terminal signal sequence, and 
a C-terminal sorting sequence 

wherein the expressed fusion protein is anchored to the host's cell 
surface; 

20 cleavmg the anchored protein from the host's cell surface using a 

protease; and 
recovering and purifying the cleaved protein. 



2. The method of claim 1 wherein the host is selected from the genus 

Aerococcus. Bifidobacterium, Corprococcus, Deinobacter, Deinococcus , 
25 Emerococcus , Gemella, Lactobacillus, Lactococcus, Leuconostoc, 

MarinococciLs . Melissococcus, Micrococcus, Pediococcus, Peptococcus, 
Peptostreprococcus , Planococcus. Ruminococcus , Saccharococcus . 
SaliJiococcus , Carcina, Staphylococcus, Stomatococcus , Streptococcus. 
Tnchococcus. or Vagococcus. 
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3. The method of claim 1 wherein the host is from, the genus Streptococcus. 

4. The method of claim 1 wherein the host is Streptococcus gordonii. 

5. The method of claim 1 wherein the host is Streptococcus gordonii V288 
(Chalhs), GP230, GP232, or GP251. 

6. The method of claim 1 wherein the host is deficient m extracellular protease 
production. 

7. The method of claim 1 wherein the gram-positive cell wall surface protein is 
derived from the g&nus Aerococcus , Bifidobacterium, Corprococcus , 
Deinohacter, Deinococcus, Eruerococcus , Gemella, Lactobacillus, 
Lactococcus, Leuconostoc, Marinococcus , Melissococcus, Micrococcus, 
Pediococcus, Peptococcus, Pepiostreptococcus , Planococcus , Ruminococcus , 
Saccharococcus. Salinococcus , Carcina, Staphylococcus, Stomatococcus , 
Streptococcus, Trichococcus, or Vagococcus. 



8. The method of claim 7 wherein the gram-positive cell wall surface protein 
the type-6 M protein of Streptococcus pyogenes. 



is 



9. The method of claim 7 wherein the gram-positive cell wall surface protein 
comprises: 

an N-terminai signal sequence comprising a complete leader sequence 
and a number of amino acid residues from the N-terminus of the mature 
protein; 

a C-terminal sorting signal comprising the LPXTGX motif, wherein X is 
any amino acid residue, residing within a proline/glycine nch region; 
a hydrophobic membrane spanning domain: and 
a charged tail. 
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10. The method of claim 9, wherein the gram-posiuve cell wall surface protein 
additionally comprises a DNA fragment having a recognition sequence for at 
least one restriction endonuclease (the multiple cloning site, MCS) located 
between the DNA sequences that encode the amino- and carboxy- terminal 
sorting signals. 

11. The method of claim 1 wherein the plasmid vector further comprises at least 
one cleavage site for a proteolytic enzyme. 

12. The method of claim 1 wherein the protease is the proteinase of tobacco etch 
virus (TEV), thrombin, enterokinase, or factor Xa. 

13. The method of claim 9 wherein the plasmid vector additionally comprises a 
variably sized protein spacer or linker sequence that physically displaces the 
fusion protein away from the cell surface. 

14. The method of claim 13 wherein the linker sequence is incorporated between 
the protease recognition sequence and the C-terminal sorting signal domain. 

15. The method of claim 1 wherein the cleaved proteins are purified in a single 
step through the use of affmity tags. 

16. The method of claim 15 wherein the affinity tags are polyhistidme tracts 
containing from 6 to 10 histidine residues, the IgG binding domain of protein 
A, or glutathione S -transferase. 

17. The method of claim 1 wherein the cleaved proteins are purified using metal 
chelation, protein A-Sepharose, glutathione-Sepharose column 
chromatography, or a combination thereof. 
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18. The method of claim 1 wherein the plasmid vector additionally comprises 
extrachromosomal genetic markers that are sufficient to provide a selective 
advantage to a host expressing said genetic markers. 

19. The method of claim 18 wherein the genetic markers are antibiotic genes, 
heavy metal resistance genes, and nutritional determinants. 

20. The method of claim 18 wherein the genetic markers are capable of being 
expressed in both E. coli and gram-positive hosts. 

21. The method of claim 18 wherein the plasmid vector additionally comprises a 
polylinker or multiple cloning site which includes restriction endonuclease 
cleavage sites. 

22. The method of claim 7 wherein the plasmid vector comprises sequences 
coding for the M6 ribosome binding site, the translation initiation codon 
(ATG), the 42 amino acid leader sequence (-42 to -1), and a portion of the 
mature N-terminus which is at least 5 amino acids long. 

23. The method of claim 22 wherein the portion of the N-terminus coded for 
comprises residues -42 to -f 16. 

24. The method of claim 1 wherein the C-terminal sorting signal is derived from 
the M protein of type-6 S. pyogenes. 

25. The method of claim 1 wherein the plasmid vector additionally comprises the 
M6 C-terminal residues 302 to 441, termination codon (TAA), and 
downstream sequences which comprise a transcriptional terminator. 



26. The method of claim 1, wherein the plasmid vector is pSPEXla. 
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27. The method of ciaim 26, wherein the pSPEXla vector additionally compnses 
a protease which has a cleavage site of greater than 4 amino acid residues. 

28. The method of claim 26, wherein the protease is the TEV NIa proteinase, 
which cleaves a specific consensus cleavage site which spans the seven amino 
acid sequence E-X-V/I/L-Y-X-Q*^S/G, wherein X can be any amino acid 
residue. 



29. The method of claim 28, wherein the cleavage site is E-N-L-Y-F-Q*G. 

30. The method of claim 28, wherein the cleavage site is located adjacent to the 
proline/glycine-rich region harboring the LPXTGX motif. 

31. The method of claim 28, wherein a protein spacer is added between the TEV 
proteinase recognition sequence and the prolme/glycine-rich anchor region. 

32. The method of claim 31, wherein the spacer is derived from the M6 protein 
and consists of amino acid residues 222 to 301. 



33. A method for expressing a desired protein in gram-positive bacteria, 
15 comprising: 

selecting a heterologous, gram-positive bacterial host; 
introducing into the host a plasmid vector comprismg 
DNA fragments encoding 

ammo and carboxy terminal sorting sequences of a gram-positive 
20 cell wall surface protein, and 

the desired protein; and 
transcnptional and translational control sequences that function in the 
chosen host; 
expressing a fusion protein comprising 
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the desired protein, and 

an N-terminal signal sequence 

wherein the expressed protein is secreted from the cell; and 
purifying the protein product from the supernatant fluid following 
secretion . 

34. The method of claim 33 wherein the host is selected from the genus 
Aerococcus, Bifidobacterium, Corprococcus, Deinobacrer, Deinococcus . 
Enterococcus, Gemella, Lactobacillus, Lactococcus, Leuconostoc , 
Marinococcus . Melissococcus, Micrococcus, Pediococcus , Peptococcus, 
Peptostreptococcus , Planococcus . Ruminococcus , Saccharococcus , 
Salinococcus , Carcina, Staphylococcus, Stomatococcus , Streptococcus , 
Trichococcus , or Vagococcus, 

35. The method of claim 33 wherein the host is from the genus Streptococcus, 

36. The method of claim 33 wherein the host is Streptococcus gordonii. 

37. The method of claim 33 wherein the host is Streptococcus gordonii V288 
(Challis), GP230, GP232, or GP25L 

38- The method of claim 33 wherein the host is deficient in extracellular protease 
production. 

39. The method of claim 33 wherein the gram-positive cell wall surface protein 
is derived from the genus Aerococcus, Bifidobacterium, Corprococcus , 
Deinobacter, Deinococcus, Eruerococcus , Gemella, Lactobacillus, 
Lactococcus, Leuconostoc , Marinococcus , Melissococcus , Micrococcus, 
Pediococcus, Peptococcus, Peptostreptococcus , Planococcus, Ruminococcus , 
Saccharococcus , Salinococcus, Carcina, Staphylococcus, Stomatococcus , 
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Streptococcus , Trichococcus . or Vagococcus. 

40. The method of claim 39 wherein the gram-positive cell wall surface protem 
is the type-6 M protem of Streptococcus pyogenes, 

41. The method of claim 39 wherein the gram-positive cell wall surface protein 
5 comprises: 

an N-terminal signal sequence composing a complete leader sequence 
and a number of amino acid residues from the N-terminus of the mature 
protein; 

a C-terminal sorting signal comprising the LPXTGX motif, wherein X is 
10 any amino acid residue, residing within a proline/glycine rich region; 

a hydrophobic membrane spanning domain; and 
a charged tail. 

42. The method of claim 41, wherein the plasmid vector additionally composes a 
DNA fragment having a recognition sequence for at least one restriction 

15 endonuclease (the multiple cloning site, MCS) located between the DNA 

sequences that encode the amino- and carboxy- terminal sorting signals. 

43. The method of claim 41 wherein the expression vector additionally comprises 
a variably sized protein spacer or linker sequence that physically displaces 
the fusion protein away from the cell surface. 

20 44. The method of claim 43 wherein the linker sequence is incorporated between 
the protease recognition sequence and the C-terminal sorting signal domain. 

45. The method of claim 33 wherein the plasmid vector additionally comprises 
extrachromosomal genetic markers that are sufficient to provide a selective 
advantage to a host expressing said genetic markers. 



wo 96/40943 



PCT/L'S96/0996 



-30- 

46. The method of claim 45 wherem the genetic markers are antibiotic genes, 
heavy metal resistance genes, and nutritional determinants. 

47. The method of claim 45 wherein the genetic markers are capable of being 
expressed in both E. coli and gram-positive hosts. 

48. The method of claim 45 wherein the plasmid vector additionally compnses a 
polylinker or multiple cloning site containing restriction endonuclease 
cleavage sites. 

49. The method of claim 39 wherein the plasmid vector comprises sequences 
coding for the M6 ribosome binding site, the translation initiation codon 
(ATG), the 42 amino acid leader sequence (-42 to -1), and a portion of the 
mature N-terminus which is at least 5 amino acids long. 

50. The method of claim 49 wherein the portion of the N-terminus coded for 
comprises residues -42 to +16. 

51. The method of claim 33 wherein the vector is one of the SPEX2 series of 
vectors. 

52. The method of claim 33 wherein recombinant proteins are purified using 
metal chelation chromatography. 

53. The method of claim 33 wherein non-essential N-terminal residues can be 
removed with TEV NIa proteinase treatment. 
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